Scaling Policy Positions From Coded Units of Political Texts∗
نویسندگان
چکیده
Applying a coding scheme to discrete text units has long been the most common method for estimating substantive quantities of interest about the authors of these texts, whether for political, social, economic, or other substantive reasons. In political analysis, researchers typically build scales of policy positions from the relative frequencies of text units coded as left versus right policy categories. In this paper we reexamine the theoretical and linguistic basis for such scales, proposing a new alternative based on the logarithm of odds-ratios that is consistent with this underlying political and linguistic mechanism. We contrast this scale to previous approaches using text units coded into political categories from the discipline’s longest-running content analysis dataset, that of the Comparative Manifesto Project (CMP). We demonstrate that the logit scale avoids widely acknowledged flaws in previous approaches and validate it through comparison to independent expert surveys of policy positions. Applied to existing CMP data, without requiring any estimation or inferential procedures, we show how to unlock more policy dimensions, for more years, than have ever been provided before, and we make this new dataset available along with estimates of uncertainty for each measure. Finally, we draw some lessons for the future design of coding schemes for political texts.
منابع مشابه
Scaling Policy Preferences from Coded Political Texts
Scholars wanting to estimate substantive quantities of interest, for example policy positions, from political texts typically apply a coding scheme to discrete text units such as words or sentences. Scales of policy positions, for example a left-right scale of economic policy, are typically built from the relative frequencies of text units coded into different categories. In this paper we reexa...
متن کاملA Dynamic State-Space Model of Coded Political Texts
This article presents a new method of reconstructing actors’ political positions from coded political texts. It is based on a model that combines a dynamic perspective on actors’ political positions with a probabilistic account of how these positions are translated into emphases of policy topics in political texts. In the article it is shown how model parameters can be estimated based on a maxi...
متن کاملEstimating Policy Positions from Political Texts
D eriving reliable and valid estimates of the policy positions of key actors is fundamental to the analysis of political competition. Various systematic methods have been used to do this, including surveys of voters, politicians, and political scientists, and the content analysis of policy documents. Each method has advantages and disadvantages but, for both theoretical and pragmatic reasons, p...
متن کاملExtracting Policy Positions from Political Texts Using Words as Data
We present a new way of extracting policy positions from political texts that treats texts not as discourses to be understood and interpreted but rather, as data in the form of words. We compare this approach to previous methods of text analysis and use it to replicate published estimates of the policy positions of political parties in Britain and Ireland, on both economic and social policy dim...
متن کاملEstimating Uncertainty in Quantitative Text Analysis∗
Several methods have now become popular in political science for scaling latent traits— usually left-right policy positions—from political texts. Following a great deal of development, application, and replication, we now have a fairly good understanding of the estimates produced by scaling models such as “Wordscores”, “Wordfish”, and other variants (i.e. Monroe and Maeda’s two-dimensional esti...
متن کامل